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SYSTEM MTO METHOD FOR PERFORMING PRE-EMPTIVE PROTECTION 

SWITCHING 

Field of the Invention 

The invention relates to communications network 
protection switching. 

Background of the Invention 

In the event there is a network failure which causes 
a service being provided on a first path through the network to 
go down, it is common to invoke a protection switching 
mechanism whereby the service is switched over to another path. 

Various approaches exist, such as 1+1 protection, 
where data is sent over both a main path and a second path at 
all times. This provides low data loss, but requires 
approximately twice the network resources for each service. 

Another approach is 1:1 protection, where a second 
path is configured, and in which low priority traffic may use 
the second path until it is required, at which point the low 
priority traffic is pre-empted. This provides low data loss. 
Its use of network resources depends upon the availability of 
low priority traffic. Also the pre-emption of the low priority 
traffic is an undesired aspect. 

Yet another approach is to employ routing table 
updates, where a protocol like OSPF is used to update routing 
tables upon a failure. This provides good network utilization, 
but there is a large data loss as the routing tables across the 
network are updated. 

Another solution involves reconnection, where a new 
path is calculated and set up after the failure occurs. Again, 
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this provides good network utilization, but there is 
substantial data loss as the new path is calculated and set up. 

It is common to perform BER measurement in SONET 
networks, see for example U.S. patent No. 5,764,651 which 
teaches the generation of signal degrade conditions when the 
BER becomes a certain value, and signal fail if the BER is even 
worse than that. However, the detection of these conditions 
has only been used to trigger the above discussed protection 
mechanisms. The signal degrade conditions are sent to a 
network management platform where a warning alarm might be 
raised. 

In U.S, patent No. 6,141,532, a system of deciding 
whether to connect a call through a primary or backup 
transmission line is disclosed. The reference teaches 
analyzing the quality of a transmission line by looking at C/N 
and BER and sending this to a forecaster. The forecaster 
decides the likelihood that a failure is in the process of 
occurring within some predetermined time, such as the ''mean 
call holding time'' . If a failure is significantly likely, the 
call is connected through a backup transmission line, and 
otherwise, the call is connected through the primary 
transmission line. The reference does not deal with performing 
protection switching after a call is connected and is ongoing. 

It is noted that it is common to employ FEC (forward 
error correction) to allow a certain number of errors to be 
corrected. Because of this, the raw BER can increase to a 
certain value with no impact on the error rate in the corrected 
bit stream, i.e. the corrected BER. However, once the BER 
reaches this certain value, sometimes referred to as the 
"coding correction cliff", incorrect decoding results in a 
large increase in the corrected BER. The correction 
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capabilities of FEC codes decrease by several orders of 
magnitude once the raw BER reaches this value. 

Summary of the Invention 

Methods and network nodes are provided which are 
5 adapted to perform protection switching on the basis of raw 
signal quality information, such as raw BER information, in a 
manner which instigates the protection switching before an 
actual failure has occurred. 

In some embodiments, these methods leverage forward 
10 error correction and uncorrected BER to effect traffic 

redirection before faults are observed at layer 2 and above. 
This is as opposed to the method of switching layer 2 traffic 



ft] in response to bitstream characteristics observed at that 

layer. 

M 15 According to a first broad aspect, the invention 

FlJ: 

fu provides a method of performing protection switching in a 

communications network. The method involves a) on an ongoing 

Q- 

M basis, monitoring a raw quality measure in respect of a first 

path through the communications network; b) on an ongoing 

2 0 basis, deciding on the basis of the quality measure whether a 

failure on the first path is likely to occur in the immediate 
future; c) after deciding a failure is likely to occur in the 
immediate future but before occurrence of a failure, 
instigating a switch to a protection path through the network. 
25 The quality measure may for example be a raw bit error rate 
(BER) . 

The invention may be applied in the context of 
various networks, for example an optical network, such as a WDM 
optical network in which case the first path is a wavelength 

3 0 channel through an optical network. 
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The decision of whether a failure is likely can be 
made using any suitable technique. For example, it can be as 
simple as determining if the quality measure has passed some 
threshold. It might require the quality measure to exceed two 
thresholds in some short time period. It might be based on the 
rate of change between consecutive measurements. 

Depending upon the rate at which the protection 
switch can be completed, in some embodiments the complete 
switch can be done prior to failure of the first path. 

In some embodiments, instigating a switch to a 
protection path through the network is done for higher priority 
traffic before being done for lower priority traffic. 

In some embodiments, the method further involves 
making connection routing decisions for new connection requests 
taking into consideration raw bit error rates collected for the 
network in a manner which encourages the use of links/paths 
with good raw BER over links/paths with poor raw BER. 

Another broad aspect of the Invention provides a 
method of performing protection switching in an optical 
communications network. The method involves a) on an ongoing 
basis, monitoring a raw quality measure in respect of a first 
light path between components in an optical communications 
network; b) on an ongoing basis, deciding on the basis of the 
raw quality measure whether a failure on the first light path 
is likely to occur in the immediate future; c) after deciding a 
failure is likely to occur in the immediate future but before 
occurrence of a failure, instigating a switch from a first path 
using said first light path to a second path not using said 
first light path, optionally switching at least one service 
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from the first link to the protection link in a sequence based 
on priority of the services. 

Another broad aspect of the invention provides a 
network node having an input for receiving on an ongoing basis 
raw BER measurements in respect of a path through a network of 
which the network node forms a part, and having decision means 
adapted to, on an ongoing basis, decide on the basis of the raw 
BER measurements whether a failure on the path is likely to 
occur in the immediate future, and after deciding a failure is 
likely to occur in the immediate future but before occurrence 
of the failure to instigate a switch to a protection path 
through the network. 

In some embodiments, the network node is adapted for 
use in an optical network, wherein the first path is a 
wavelength channel through an optical network. 

Brief Description of the Drawings 

Preferred embodiments of the invention will now be 
described with reference to the attached drawings in which: 

Figure 1 is a schematic diagram of an example network 
within which the protection switching methods provided by the 
invention may be employed; 

Figure 2 is a block diagram of a network node of 
Figure 1 according to an embodiment of the invention; 

Figures 3 and 4 are plots of corrected throughput and 
input BER for an example failure scenario; 

Figure 5 is an example of a network in which 
prioritized re-routing has been performed; and 
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Figure 6 is a block diagram of a system in which raw 
BER information is used to make routing decisions at connection 
setup - 

Detailed Description of the Preferred Embodiments 

A failure in a telecommunications network generally 
happens over a very short time period. Although this time is 
short, it is not instantaneous. For example, consider a 
backhoe cutting a fiber optics cable. As the backhoe comes 
into contact with the sheathing around the cable, the fiber 
inside it is bent, in an increasingly sharp manner, until it is 
broken. Even if the backhoe is moving quickly, this process 
will take a number of milliseconds. 

As the fiber is bent, the light signal degrades, 
resulting in an increasing number of bit errors. This bit 
error rate (BER) can be monitored by network equipment to which 
the fiber is connected. 

An example network is shown in Figure 1. Shown is a 
WDM system having five optical networking nodes Nl, N2, N3, N4 
and N5 interconnected by optical fiber in a ring configuration 
generally indicated by 10. In a WDM (wavelength division 
multiplexing) system, signals are sent over the fiber ring 10 
using multiple wavelengths. In Figure 1 three wavelengths A-i, 
^2 and Xz are shown. A ''service" between Nl and N3 delivers 
data between those two nodes. This might employ, for example, 
a primary path between Nl and N3 generally indicated by 12 
which might for example employ Xi. A protection path 14 is also 
shown which might use the same or some other wavelength. The 
protection path 14 may be a dedicated protection path, or a 
shared protection path to be made available for multiple users 
for example. A ^^path" for this optical networking example is 
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an end-to-end connection. This might involve one or more light 
paths. A light path is part or all of a path, with electrical 
optical (EO) conversion taking place at one end of the light 
path and optical electrical (OE) conversion taking place at the 
other end of the light path. Light paths may also be referred 
to as '"hops". Finally, an optical connection between adjacent 
optical components at least one of which does not perform an OE 
or EO conversion will be referred to as a light section. 

In the example of Figure 1, path 12 has a single 
light path, since the signal is not dropped at node N2 . 

Protection may be available at the path level or at 
the light path level. 

The networking nodes Nl, N2, N3, N4 and N5 may 
perform add/drop multiplexing. For example, for a primary path 
between Nl and N3 on a particular wavelength A,i, the primary 
path goes through N2 unaffected and is dropped at N3 . 

Data is transmitted through the system in a 
packetized format with a redundancy scheme such that some error 
correction can be performed at the receiver. For example a 
digital wrapper may be used that supports FEC (forward error 
correction) . Through the use of such FEC, bit errors can be 
detected and corrected, up to a certain BER threshold with the 
result that throughput of the actual data is largely unaffected 
for a transmission BER up to that BER threshold. Above the BER 
threshold, the errors can no longer be corrected, and the 
service provided by the signal is affected. In conventional 
systems which employ this corrected BER to perform restoration 
switching, there is a delay in the time taken to respond to a 
fault until the fault is detectable at a higher layer, such as 
layer 2 for example. 
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Typically, corrected BER is determined at points 
where optical-electrical conversion are performed, i.e. for 
each light path. 

Restoration provides the ability to reroute a service 
5 around a network failure. Previously, this was done once the 
service had completely failed as determined by examining the 
corrected BER. According to the invention, the restoration 
process is started before the errors become uncorrectable, and 
preferably a complete switch to the protection path through the 
10 network is made without affecting the service's data stream. 

In some embodiments where it is appropriate to think of layered 
2 protocol stacks, these methods leverage forward error 

nJ correction and uncorrected BER to effect traffic redirection 

before faults are observed at layer 2 and above. This is as 
5fj 15 opposed to the method of switching layer 2 traffic in response 



to bitstream characteristics observed at that layer. 



Referring now to Figure 2, shown is a network node 
yl adapted to initiate restoration in this manner. Any number of 

C^. the nodes Nl, N2, N3, N4 and N5 of Figure 1 might be 

2 0 implemented in this manner. The network node has a quality 
determination block 20 which estimates, measures or otherwise 
determines a raw quality measure somehow associated with 
received signal 24 . In a preferred embodiment and the 
remainder of this description, this quality measure is assumed 

2 5 to be a BER (bit error rate) , but this need not necessarily be 

the case. It is important that whatever the raw quality 
measure is, it can be determined in real time from the received 
signal. Typically the raw quality measure is determined for 
each wavelength since a given service will typically use a 

3 0 channel consisting of a single wavelength. 
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The raw quality measure may be an end-to-end quality 
measure for an entire path. Alternatively, the raw quality 
measure for a given path will be the raw quality measures of 
each light path making up the path. 

In some embodiments, the raw quality measure, 
preferably the raw BER is advertised as a state metric for each 
path and/or light path by one or more network nodes having 
knowledge of this state. This state metric information would 
be received the network node of Figure 2 and coordinated with a 
given service for which protection switching decisions are then 
made. Then, whatever nodes are responsible for making 
protection switching decisions can make their decisions on the 
basis of this advertised information. 

Also shown is a failure predictor block 22 which 
processes the raw quality measures from the quality 
determination block 20. The failure predictor block 22 makes a 
decision as to whether recent quality measures are indicative 
of a pending failure of the particular channel. The failure 
predictor block 22 makes this decision prior to the actual 
failure of the channel, during the period in which bit errors 
are still correctable by the error correction scheme. In the 
event the decision is made that a failure is likely, an 
''initiate protection switch" signal 26 is generated by the 
failure predictor block 22. For the purposes of this 
invention, this signal is symbolic of the decision that a 
failure is likely to occur in the immediate future. Such a 
signal may not necessarily be generated if protection switching 
can be effected locally* 

Figure 3 shows an illustrative failure scenario 
timeline with an input BER 30, and output corrected throughput 
32 (the scale values on the axes are for illustrative purposes 
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only) . The network failure starts to occur at time tO at which 
time the input BER 30 starts to increase. Initially, the FEC 
is capable of correcting these errors, and so the corrected 
throughput 32 remains unaffected. However, when the BER does 
reach a threshold, IBf, approximately 0.18 in this example, the 
FEC can no longer correct the errors, and so the corrected 
throughput 32 drops. Typically, once the BER exceeds the 
threshold, the corrected throughput 32 will drop very quickly 
with increasing BER. 

According to one embodiment of the invention in order 
to minimize data loss, restoration is started before the BER 
reaches IBf, for example when the BER reaches a smaller 
threshold IBr. 

In one embodiment, the following protection scheme is 
employed, this being shown by way of example in Figure 4 where 
tn corresponds to a time value on the horizontal axis: 

tO: network is functioning without problem. 

tl: network failure starts, input BER starts to 
increase. FEC corrects errors. 

t3 : input BER reaches IBr and/or failure predictor 
decides failure likely to occur. Restoration starts. FEC 
continues to correct errors. 

between t3 and t4 : service starts using the 
protection path with minimal and preferably no data loss. 
BER returns to nominal small value. The BER never reaches 
IBf. 

The mechanics of how the protection switchover is 
implemented are not essential to the invention. Preferably, the 
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switchover is done quick enough that data is flowing on the new 
path before the failure occurs. However, if the primary path 
fails before the protection switching can be completed there 
will be some data loss. 

Preferably, some intelligence is employed in the 
failure predictor 22 rather than simply using a hard threshold 
IBr, so as to distinguish between a scenario in which the BER 
is temporarily increased, but a failure is not about to occur. 
For example, the failure predictor could require that in 
addition to the latest BER exceeding the threshold IBr, a rate 
of increase (for example between two consecutive measurements) 
must also exceed some value indicating that a failure is likely 
with the assumption that a slow increase is less likely to be 
indicative of an immanent failure. A first or higher order 
derivative approximation might alternatively be employed. 

In another example, two predictive thresholds (both 
below the failure threshold) may be used, and if the two 
thresholds are crossed in a short enough period of time, then 
the decision to instigate protection switching is made. 

In another embodiment of this invention, during the 
restoration period, the new path is created, data is sent on 
both paths, and the receiver switches to data arriving on the 
new path before the first path stops sending data. This 
ensures there is no gap in the data. In this case the timeline 
would be as follows : 

TO: network is functioning without problem 

Tl: network failure starts, input BER starts to 
increase. FEC corrects errors. 
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T2: input BER reaches IBr and/or failure predictor 
decides failure likely to occur. Restoration starts. FEC 
continues to correct errors. 

T3: Service still using first path. FEC continues to 
correct errors on the first path. The new path is up and 
is passing data. 

T4: service starts using the new path. FEC continues 
to correct errors on the first path. 

T5: input BER on first path reaches Ibf, and first 
path is dropped. 

The above embodiment of the invention has focussed on 
the protection switching of paths by predicting failure based 
on BER. This involves layer 2 (path maintenance) in the 
protocol stack making use of layer 1 data (BER) which is 
somewhat of a break from convention. Preferably, the BER 
information is forwarded to the source for each path, and the 
source makes the decision as to whether or not to re-route. 

In another embodiment of the invention, the approach 
is used to perform layer 1 protection switching. For this 
embodiment, a specific light path (within a path) may have a 
primary light path and a protection light path. The scheme is 
then used to switch from the primary light path to the 
protection light path. This involves determinig a BER on a per 
light path basis. 

In another embodiment, for either layer 1 protection 
switching or layer 2 protection switching, preferably high 
priority traffic is switched to the protection path/light path 
prior to low priority traffic. Advantageously, high priority 
traffic is more likely to get completely switched before a 
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failure occurs when this technique is employed. An example of 
this is shown in Figure 5 where it is assumed that Xi and X2 are 
lower priority than X^, and that switchover has started with the 
result that ^.3 has been switched over to the protection path 14 
prior to the two lower priority wavelengths. 

In another embodiment of the invention, the raw BER 
determinations are used in making traffic engineering 
decisions, typically a layer 3 function. In this embodiment, 
raw BER information is maintained on an ongoing basis for 
available paths and/or links through a network. Then, in 
addition to any other constraints which might be employed in 
making routing decisions, the BER information is also 
considered. In such a manner, paths/light paths with good raw 
BER information are favoured over paths /light paths with poor 
raw BER information. This might be implemented by including 
the BER in the administrative cost for example. 

An example of this is shown in Figure 6. Shown is a 
network generally indicated by 50, and a routing system 
generally indicated by 52, typically implemented as part of 
network 50. The routing system decides how to route new 
requests for connections from source to destination in the 
network 50. On an ongoing basis, the network 50 provides raw 
BER information 54 to the routing system 52. This might be 
done on a path or a light path basis. The routing system 52 
then considers the BER information thus collected in the 
determination of new routes. The determination of the routes 
may involve the determination of a primary and a protection 
path for the connection. The routing system 52 may employ well 
known mult i- constraint algorithms to which the additional 
parameter of raw BER are added. 
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In a modification to the above -described embodiments 
of the invention, the failure predictor block 22 can process 
the quality measures from the quality determination block 20 by 
using a predictive mechanism based on defining derivatives of 
BERs. The block 20 measures an initial BERo value when the 
system is set up, and stores the value in a memory of the block 
22. The control circuit of the block 20 continues to measure 
BER at periodic time intervals (i.e. ... tn-2, tn-i, tn ...) , stores 
the corresponding values of BERs {BERn-2, BERn-i, BERn, ... ) in 
the memory, and sends the stored value to the failure predictor 
block 22. At each time interval (tn) the control circuit of the 
predictive block 22 calculates a deviation of the current BER 
from the initial BERo: 

n-l = BERn-i - BERo (1) 

n = BERn - BERo (2) 

The control circuit also calculates a speed of BER change by 
taking a derivative from the BER deviations and predicts a BERn+i 
at the next time interval (tn+i) : 

D ^^^LlA^ (3) 

BER„,,=A„-^D„^(t^-t„_,) (4) 
The predicted BERn+i is compared with a predetermined threshold 
value. If the amplitude of the predicted BERn+i is greater than 
the predetermined threshold value, the control circuitry of the 
predictor block 22 generates a signal to switch to a protection 
path through the network. In other variations, at least one 
second or higher order derivative may be used. 

Numerous modifications and variations of the present 
invention are possible in light of the above teachings. It is 
therefore to be understood that within the scope of the 
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appended claims, the invention may be practiced otherwise than 
as specifically described herein. 

For example, while the invention has been described 
in the context of a WDM system, more generally it may be 
employed in networks subject to failure. 

For example, in another embodiment, the raw quality 
measure for a path or a light path is determined as a function 
of one or more raw quality measures for light sections making 
up the path or light path. This involves making a raw quality 
measure in the optical domain. 



